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20 <400> 
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PATENT APPLICATION : US/09/664,186 TIME: 16:20:16 

Input Set : N:\Crf3\RULE60\09664186.raw 
Output Set: N:\CRF3\05242002\I664186.raw 

APPLICANT: Wayne, Jay 
Xu, Shuang-yong 

TITLE OF INVENTION: Method For Construction Of Thermus-E. coli Shuttle 
Vectors And Identification of Two Thermus Plasmid 
Replication Origins 

FILE REFERENCE: Thermus Shuttle Vector 
CURRENT APPLICATION NUMBER: 09/664,186 
CURRENT FILING DATE: 2000-09-18 
PRIOR APPLICATION NUMBER: US/0 9/134 , 246B 
PRIOR FILING DATE: 1998-08-14 
NUMBER OF SEQ ID NOS : 30 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO: 1 
LENGTH: 10 
TYPE: DNA 

ORGANISM: Thermus sp . 
SEQUENCE : 1 

rrcttttyyy 10 
SEQ ID NO: 2 
LENGTH: 7 
TYPE: DNA 

ORGANISM: Thermus sp . 
SEQUENCE: 2 

rrytttg 7 
SEQ ID NO: 3 
LENGTH: 28 
TYPE: DNA 

ORGANISM: Thermus sp . 
"SEQUENCE: 3 

ttaacctttt ttcaagaaaa agagataa 28 
SEQ ID NO: 4 
LENGTH: 1026 
TYPE : DNA 

ORGANISM: Thermus sp . 
FEATURE : 
NAME/KEY : 
LOCATION: 
SEQUENCE : 



CDS 

(1) • • (1023) 
4 

gtg aag aac gaa aaa acc ttc ttt gaa gag ctt tac gag get tta gag 
Met Lys Asn Glu Lys Thr Phe Phe Glu Glu Leu Tyr Glu Ala Leu Glu 

! 5 10 15 

gaa acc cac gac aac acc gat gcc act agg ggg tea gat agg ggg tea 
Glu Thr His Asp Asn Thr Asp Ala Thr Arg Gly Ser Asp Arg Gly Ser 



48 



96 
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50 








20 










25 










30 






144 


51 


gag 


gac 


ttc 


ttc 


ttg 


gee 


acc 


gac 


ccc 


cct 


cca 


gat 


gga 


ggt 


gee 


gaa 


52 


Glu 


Asp 


Phe 


Phe 


Leu 


Ala 


Thr 


Asp 


Pro 


Pro 


Pro 


Asp 


Gly Gly Ala 


Glu 




53 




35 










40 










45 








192 


54 


aat 


cgc 


etc 


gcg 


aag 


ggc 


ttt 


aca 


tac 


caa 


aaa 


gag 


gca 


ctt 


agg 


att 


55 


Asn 


Arg 


Leu 


Ala 


Lys 


Gly 


Phe 


Thr 


Tyr 


Gin 


Lys 


Glu 


Ala 


Leu 


Arg 


He 




56 




50 










55 










60 










240 


57 


get 


tta 


ccc 


gag 


aaa 


gac 


cat 


gag 


get 


ttc 


ctt 


tec 


tct 


gtt 


ggg 


gee 


58 


Ala 


Leu 


Pro 


Glu 


Lys 


Asp 


His 


Glu 


Ala 


Phe 


Leu 


Ser 


Ser Val Gly Ala 




59 


65 










70 










75 










80 


288 


60 


ccc 


cct 


ata 


cca 


cca 


get 


gaa 


ccc 


ccc 


gtt 


ggg 


aat 


gta 


tgt 


caa 


gee 


61 


Pro 


Pro 


He 


Pro 


Pro 


Ala 


Glu 


Pro 


Pro 


Val 


Gly 


Asn 


Val 


Cys 


Gin 


Ala 




62 










85 










90 










95 




336 


63 


gtc 


cag 


gac 


ggg 


cct 


cag 


aag 


ctt 


ctg 


gaa 


etc 


etc 


cag 


gag 


att 


gee 


64 


Val 


Gin 


Asp 


Gly 


Pro 


Gin 


Lys 


Leu 


Leu 


Glu 


Leu 


Leu 


Gin 


Glu 


He 


Ala 




65 






100 










105 










110 






384 


66 


cgc 


tec 


acc 


ate 


ccc 


tac 


ggc 


aac 


egg 


gag 


etc 


tgg 


agg 


aag 


gtg 


ggg 


67 


Arg 


Ser 


Thr 


lie 


Pro Tyr Gly 


Asn 


Arg 


Glu 


Leu 


Trp 


Arg 


Lys 


Val 


Gly 




68 




115 










120 










125 








432 


69 


acg 


gtc 


gtc 


ttc 


atg 


gtc 


ccc 


ctg 


gag 


atg 


ttg 


gee 


etc 


aac 


ctg 


ggg 


70 


Thr 


Val 


Val 


Phe 


Met 


Val 


Pro 


Leu 


Glu 


Met 


Leu 


Ala 


Leu 


Asn 


Leu Gly 




71 




130 










135 










140 










480 


72 


qtc 


acc 


egg 


cag 


acc 


gtc 


cac 


gee 


tgg 


aag 


aag 


gtc 


ctt 


gag 


aaa 


aag 


73 


Val 


Thr 


Arg 


Gin 


Thr 


Val 


His 


Ala 


Trp 


Lys 


Lys 


Val 


Leu 


Glu 


Lys 


Lys 




74 


145 








150 










155 










160 


528 


75 


ggc 


ctg 


gtg 


gee 


acc 


gac 


gtc 


ctt 


cac 


caa 


acc 


gtc 


aac 


ggg 


gag 


cgc 


76 


Gly 


Leu 


Val 


Ala 


Thr 


Asp 


Val 


Leu 


His 


Gin 


Thr 


Val 


Asn Gly Glu Arg 




77 








165 










170 










175 




576 


78 


egg 


gee 


ate 


ggc 


acc 


ctt 


tgg 


gee 


gtc 


egg 


ctg 


agg 


cca 


ggg 


aaa 


gee 


79 


Arg 


Ala 


He 


Gly 


Thr 


Leu 


Trp 


Ala 


Val 


Arg 


Leu 


Arg 


Pro 


Gly Lys 


Ala 




80 






180 










185 










190 






624 


81 


agg 


etc 


acc 


ctg 


gac 


gac 


tac 


ate 


tac 


ccc 


tgg 


agg 


aac 


etc 


gee 


eta 


82 


Ara 


Leu 


Thr 


Leu 


Asp 


Asp 


■Tyr 


He 


Tyr 


Pro 


Trp 


Arg 


Asn 


Leu 


Ala 


Leu 




83 




195 










200 










205 








672 


84 


qac 


atg 


gee 


aac 


ggc 


gtg 


etc 


tec 


ttc 


aac 


tgg 


gtc 


aag 


gee 


tac 


cag 


85 


ASP 


Met 


Ala 


Asn Gly Val 


Leu 


Ser 


Phe 


Asn 


Trp 


Val 


Lys 


Ala 


Tyr 


Gin 




86 


210 










215 










220 










720 


87 


gac 


cac 


gga 


ate 


cgc 


ccc 


acc 


ctg 


gac 


gtg 


ctg 


gtc 


etc 


tgg 


get 


cag 


88 


Asp 


His 


Gly 


He 


Arg 


Pro 


Thr 


Leu 


Asp 


Val 


Leu 


Val 


Leu 


Trp 


Ala 


Gin 




89 


225 










230 










235 










240 


768 


90 


ggg 


aaa 


agg 


gtg 


atg 


ccc 


aac 


acc 


aag 


acc 


gtg 


gee 


gtt 


gac 


ctg 


ggc 


91 


Gly 


Lys 


Arg 


Val 


Met 


Pro 


Asn 


Thr 


Lys 


Thr 


Val 


Ala 


Val 


Asp 


Leu Gly 




92 




245 










250 










255 




816 


93 


etc 


ate 


ctg 


gtc 


etc 


ccc 


gag 


gtg 


gag 


cgt 


tec 


aaa 


etc 


ccg 


gee 


ctt 


94 


Leu 


He 


Leu 


Val 


Leu 


Pro 


Glu 


Val 


Glu 


Arg 


Ser 


Lys 


Leu 


Pro 


Ala 


Leu 




95 








260 










265 










270 






864 


96 


ate 


acc 


etc 


att 


get 


acg 


tac 


att 


gee 


gat 


etc 


eta 


gat 


gac 


cgt 


cgt 


97 


He 


Thr 


Leu 


He 


Ala 


Thr 


Tyr 


He 


Ala 


Asp 


Leu 


Leu 


Asp 


Asp 


Arg 


Arg 




98 






275 










280 










285 
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99 tea aga cgt ttc tat gca ggc ttg ctg tgg get gtg gec agg ggt gaa 912 

100 Ser Arg Arg Phe Tyr Ala Gly Leu Leu Trp Ala Val Ala Arg Gly Glu 

101 290 295 300 

102 etc ccc gcg caa tat eta ttt gee gtc eta a'tg egg gtt ate cga gat 960 

103 Leu Pro Ala Gin Tyr Leu Phe Ala Val Leu Met Arg Val He Arg Asp 

104 305 310 315 320 

105 tac acg gat ggc cat ctg aca cga ccg gga gcg tac eta gtg aag acc 100 

106 Tyr Thr Asp Gly His Leu Thr Arg Pro Gly Ala Tyr Leu Val Lys Thr 

107 325 330 335 

108 etc aag gag gec tec tga 102 

109 Leu Lys Glu Ala Ser 

110 340 



112 <210> SEQ ID NO: 5 

113 <211> LENGTH: 180 

114 <212> TYPE: DNA 

115 <213> ORGANISM: Thermus sp . 

116 <400> SEQUENCE: 5 

117 etataaegge cttttaggag gggggattgc cagccgctgg getgaeggtt attttggacc 6 

118 cataaaaagg cgaaaccgag gcggttgccc cggatcaccc ccaagaccta gggtaacgee 1 

119 tcgggctcca gatgacaagg aggtccgagg gtgaagaacg aaaaaacctt ctttgaagag 1 

121 <210> SEQ ID NO: 6 

122 <211> LENGTH: 5849 

123 <212> TYPE: DNA 

124 <213> ORGANISM: Thermus sp. 

125 <400> SEQUENCE: 6 



126 tctagaaggt cagggtggac aaggaaaaca ccatagcccc tgccaagaag atggacgagt 60 

127 tggtgtccgg aaaagtggcc ateeggggeg ctcttgacaa ctattttcca gcggtggcca 120 

128 ccggcattgg ccacgaggta cgagcttgtg gagtagaegg ccacaaaggg gtcgtcctca 180 

129 aacttctttt ctagtgccgc ttggacgaag gggaggaaga ggaaaggctt catggcctca 240 

130 cctccttccc ctcctccttg geggecttag eggegtaaaa ctctgagacg gectgaagtt 300 

131 tagggatttc getttegggg ataagaatcc ggcggctcag gggatgeegg atggccctta 360 

132 tcctgccgtc ccttatgtac tegtaaatgg tggccttggg tactttaaac cgttctgaaa 420 

133 cttctctaac agagagcaca aaacctctaa aaacctatca atcccaccga ttccagtata 4 80 

134 ccataaatgg cacaaagttt tgagaaggtg gtcaaacaaa aaggctttct eggtcaggtt 540 

135 atggtgaggt gggggcggtc aaaggecgae ttaagtttgg taaagccggg aggaagcaaa 600 

136 ccggggtgtt accatgcaac agatggcega gtggaacgtg tggacacaga gaagcgttga 660 

137 gcttctggag aaggggtatt tggataaact actgeaggtc tataaagggg aaagtggctc 720 

138 ttcgaggtca gtaccagagg aggtagagga aaaacttege gaggectaca aggcatacga 780 

139 ggggaggcag gatagtcegg aggcagaaac gaaactcgtg gaagccgtgc taaatgccag 840 

140 aaaaaaggtc gagcggtccc ccttcaatca cccctacctg cctttggtct actacctggt 900 

141 ttcggaaaaa gcagaaaaag cgaacaaggc ccttgaggag geattgeagg aggttgeetc 960 

142 aaagcaccca gaaaccatcc gcgtcctggc caaggaagcg caaagaagag gegtagaage 1020 

143 cttgatccaa aggctcaagg agcctcccga aataaategg cagatagggc cgatgttcaa 1080 

144 aaggtggtac aaagaagagc taaaggggaa aatagaagag aggcttccag gccctaccaa 1140 

145 accaaagatt gtggtagtat cccctgaaaa aagtaaaccg gagcaagcac cccttattgc 1200 

146 ggagagagaa gegggcatea tcatatacac gggateggat gaagctttga aagatgeege 1260 

147 caaggaaaac ctgggccttg gegaggaage agaactaggc accaagggcg tagatttcta 1320 

148 cgtggtcatc cggcgtagcc ctgaagagac atggcaccta acaggagaag tgaagtttca 1380 

149 * atccgacttt ggeggaaace aagacaacca gaaactagta gcaaaggctt ccataaggtt 1440 
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150 ggaccttgag aagaggcaca taggaatagt ggtggtggac ggaatgcctg tggtgagcaa 1500 

151 gtttcgtggg tgggccggac tggggaaaga aacgatcgtt acatccgtac tcctccttcc 1560 

152 agacctgata gcggagctct accaaaaggg tgaagaagcc ctgggcctct agaaggcgga 1620 

153 cacaatctca aacttgtgct gtagcctggg gaaatcctct aacacccttc tagtgaaggc 1680 

154 tttgaccgcc tcccaggagg catctatgcc gatggatcgc cgctttaaga ggggtgaggc 1740 

155 tataagcgta gtaccggagc ctgcgaaggg atcgagcact aaatccccct cgttactccc 1800 

156 tgtttggacg atgagcttga gcatgtccag atttttctcg gtggggtatc gcgggtacgg 1860 

157 aggatccttg aactgccaaa cgtcctggag cttcttcccc ttcttcagge gatcccgagc 1920 

158 gtaaactttc ttccgcggca ccccgttctt tgaccagaca ataagccctt gagcgtctag 1980 

159 ctcgtcaagc ttctccgggg gatagcgcca atgccgtcca ggagggggaa gtattcctcg 2040 

160 ccaaggcctt ccggtagggc catccttggt ttctccagga gcatgcaggg gattggtggt 2100 

161 gtaccgttcc ccgttctcgt ctacaaaggg gaaaagccta gcgatctcct cttccgaata 2160 

162 ggggctagcc gattcgttcc aaacgtagtc ccgcgttttg gagtagacga ggatcatgtc 2220 

163 cttttgcgat ccgaaggcct tacgggaaaa gtttttggga tttgaagcga tgcgggcgat 2280 

164 atggttaacg aagtttcgcc ggccaaagac ctcatcaagg atgagcttca cctcgaaccc 2340 

165 gtatttctcg tctatgtgaa cgaagatcag tcctgagtcc gccatcagct ccctgagaag 2400 

166 tatcaagcgc tccctcagga actccacaaa ctgaggacca tcgagggtgt catcgtagcc 2460 

167 caactgaccg tttttgggct ggctgacggt agcaacgcga tctgtttcat cgccgccaac 2520 

168 gagaaactgc tggccggttc cataaggcgg gtcaatatag accaactgga ccttccccgc 2580 

169 atacccacca ggctcccgga gcatccaccg gagaacctga ccgttttccc ccaaaaagta 2640 

170 ggtgccaata ggatcaatct caaaaagggg ggcatttccc cctaggaaga ggagggtttc 2700 

171 ttttcgcaaa acaagttgtg gggtgggctg atcaagaatc tccttctcat cgcgttttcc 2760 

172 ggggtagacc aacctaaagg gcgaaggttc cgaggttttc gaggctttca agggggcttt 2820 

173 tcgggtcaaa ccagggtagc tacggctcat tcttccctcc ccacagcgct cttaagcagg 2880 

174 acctcatcac ccacaaccct cacgcactcc aaccaaggaa tccgccaaag gcggcctacc 2940 

175 ttttgagccc gtatcttccc ctgacgtata gaccttcgga tcgtctcagg gtgcacccga 3000 

176 aggatgtctg caagctcctc gggggtcagg tacacgggct tcatcctcat gacacaacct 3060 

177 taccccacag aggacaacac atgcaactat gggcaaagta gacaacgaga ccaaaagctt 3120 

178 gggccactct ctcaggaggc ctccttgagg gtcttcacta ggtacgctcc cggtcgtgtc 3180 

179 agatggccat ccgtgtaatc tcggataacc cgcattagga cggcaaatag atattgcgcg 3240 

180 gggagttcac ccctggccac agcccacagc aagcctgcat agaaacgtct tgaacgacgg 3300 

181 tcatctagga gatcggcaat gtacgtagca atgagggtga taagggccgg gagtttggaa 3360 

182 ■ cgctccacct cggggaggac caggatgagg cccaggtcaa cggccacggt cttggtgttg 3420 

183 ggcatcaccc ttttcccctg agcccagagg accagcacgt ccagggtggg gcggattccg 3480 

184 tggtcctggt aggccttgac ccagttgaag gagagcacgc cgttggccat gtctagggcg 3 54 0 

185 aggttcctcc aggggtagat gtagtcgtcc agggtgagcc tggctttccc tggcctcagc 3600 

186 cggacggccc aaagggtgcc gatggcccgg cgctccccgt tgacggtttg gtgaaggacg 3660 

187 tcggtggcca ccaggccctt tttctcaagg accttcttcc aggcgtggac ggtctgccgg 3720 

188 gtgaccccca ggttgagggc caacatctcc agggggacca tgaagacgac cgtccccacc 3780 

189 ttcctccaga gctcccggtt gccgtagggg atggtggagc gggcaatctc ctggaggagt 3840 

190 tccagaagct tctgaggccc gtcctggacg gcttgacata cattcccaac ggggggttca 3900 

191 gctggtggta tagggggggc cccaacagag gaaaggaaag cctcatggtc tttctcgggt 3960. 

192 aaagcaatcc taagtgcctc tttttggtat gtaaagccct tcgcgaggcg attttcggca 4020 

193 cctccatctg gaggggggtc ggtggccaag aagaagtcct ctgaccccct atctgacccc 4080 

194 ctagtggcat cggtgttgtc gtgggtttcc tctaaagcct cgtaaagctc ttcaaagaag 4140 

195 gttttttcgt tcttcaccct cggacctcct tgtcatctgg agcccgaggc gttaccctag 4 200 

196 gtcttggggg tgatccgggg caaccgcctc ggtttcgcct ttttatgggt ccaaaataac 4260 

197 cgtcagccca gcggctggca atcccccctc ctaaaaggcc gttataggcc ctgctaggag 4 320 

198 gggggtagta ctttcctacc cccctaggct tggagaggcc ttaggaggtc tcctagggcc 4 380 
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199 tcgtgggggt gtaggggtaa cctcatggcc aggccggccg gctcgggact ctggaggagg 4440 

200 cctccatagc ctactcgtgg tggaggtttg tgaaggggtt cactaatgca tacggctagc 4 500 
2 01 ctcgggatca cggccaaatg gtatgcaggt tttggtataa aaccctcagg tttgaggcta 4560 

202 gtttatgtcg gttttatgca cctttgactc ggatcacggg cataaacacc agtttcctgc 4620 

203 acgaaagaaa actttcgcga tctaagaggg ggaaagaggt gtagagggac ggccttcatg 4680 

204 aaagttggcc tcttaggagg ccgttgtaga gggccgtctc gggttcaaat cctttccctc 4740 

205 tctctccagg tttccgaggt tcgaggtctt ggtccaggtc ttgtaccaag tttttgacca 4800 

206 aagtctattc tcggaatata ggggtatctt gtctatcttc cctacgggat atctctgtct 4 860 

207 gtgtgaactt gatcccatcc caatacatat ctcaatctcc taatctcctc ttctctccag 4920 

208 atccctaatc tcttcttcta cctctttctc ctcccaatta agaatggaga ggaaaaaccc 4980 

209 cgaccagaac gagcttctcg gggtcagttt cggtaatctc gggacaggtt ttcatcgtct 5040 

210 aggacgagga ttagggcatg aaaaatgggc tttgacaaaa tctttctaaa aaatactccc 5100 

211 cgaggttggg gaagtgccct cggggagaag atttttggca gtttagatgt tatgctctat 5160 

212 cacgggccgg aggcctccac gataagttgt cttggccaag taccgggcca ggtcgggggt 5220 

213 gctcttcagc gtggtgatgg tactttcacg gaagttcaca agtcctttta gaggcttcag 5280 

214 gtcggggata gtgctcaagt actcccaagc gttctcgggc ccgtggtcgg ggagaaggac 5340 

215 aaaggggtcg ggcaaaagtt catctttgta cttaggacgg attactttag cacctgataa 5400 

216 cttcagggcc gttaagaagg gcctcacctc ggagacgggt ggaaggagga cgtgggcgtg 5460 

217 gaagaagacg aaccccgatt tttgggaagt ctccctccag tttgatgatg aacgttggga 5520 

218 ggaagccggc caggatgtct ttcatcgcgc ctcgaacctc ggacacataa aaaactttcg 5580 

219 tgtttgtcag ggcaagagtg ctatgtatga ggtaaccttc gggagtacaa agtgcctcaa 5640 

220 gccgcctttc ccaacgctcc aaaactctag ggtcaggtgg tttaggtttt ctgaaaaact 5700 

221 ctagcttttc agtggtcatt cctcacccct ctagcacgta ctctggaagg taaacctttg 5760 

222 acacagcggc caagtctagc gtctcccagt ccagttggtc tgggacgcgt gagaagggga 5820 

223 ggggcttggt gtagaggacc agaagaccc 5849 



225 <210> SEQ ID NO: 7 

226 <211> LENGTH: 603 

227 <212> TYPE: DNA 

228 <213> ORGANISM: Thermus sp. 

229 <400> SEQUENCE: 7 



230 atgatcgtgg ctgtcaccgg cttcaaggga ggggtgggga agaccaccac ggcggtccac 60 

231 ctggcctgct tcctggccga gcggggcccc accctgctgg tggacgggga ccccaaccgc 120 

232 tccgccacgg ggtggcaccg gaggggaggc ctcccggtga ccgtggtgga cgagcgggtg 180 

233 gcggcccggt acgcccggga gcacgcccac gtggtcatag acacccaggc ccgccccacg 240 

234 gaagaggacc tccgggccct cgccaagggg gtggacctgc tggtcctgcc cacgtccccc 300 

235 gacgccctgg ccctggaggc cctcctggcc accctggaag ccctgcgggg ggcggaggcc 360 

236 cgcttccggg tcctcctgac catggtgccc ccgcccccga gccgggacgg ggaggaggcc 420 

237 cgggccctct tgggggcgga gggcgttccc ctcttcacag gctgggtgag gcgggcggca 480 

238 gccttcccca aggccgccct cctgggggtg cctgtctacc gggtgcccga ccccagggcg 540 

239 aggctggcct ggggggacta cgcgcgggtg ggggaagagc tcctgaagga ggtgggggga 600 

240 tga 603 



242 <210> SEQ ID NO : 8 

243 <211> LENGTH: 11958 

244 <212> TYPE: DNA 

245 <213> ORGANISM: Thermus sp . 

246 <400> SEQUENCE: 8 

247 cttatacaca caaactatac acgtctctat cgggcttttc ttagcgccat gtaaaacacc 60 
24 8 cctcccatct ccgggtgttt acagcggata cgggaggttc agcgggaact tttccccttg 12 
24 9 ttgaaacttt ggggtctgag gctcaacagc agaacagctt aggttgactc aacacagctc 18 
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